vae training and autovae #11592

linnanwang · 2024-12-13T23:55:55Z

What does this PR do ?

this PR introduces VAE training into NeMo that allows users to customize a 16x or 8x reduction VAE on their in house data. Besides, we also introduce a feature that automatically designs VAE based on the GPU memory and latency requirements.

Collection: NeMo/Diffusion/VAE

Changelog

it is a new feature, no touch of existing codes.

Usage

Please follow the readme to setup the training.

GitHub Actions CI

The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.

The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".

Before your PR is "Ready for review"

Pre checks:

[Yes] Make sure you read and followed Contributor guidelines
[No] Did you write any new necessary tests?
[Yes] Did you add or update any necessary documentation?
[No] Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- [No] Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

New Feature

Signed-off-by: linnan wang <[email protected]>

nemo/collections/diffusion/vae/contperceptual_loss.py

+
+        self.discriminator = NLayerDiscriminator(
+            input_nc=disc_in_channels, n_layers=disc_num_layers, use_actnorm=use_actnorm


Signed-off-by: linnan wang <[email protected]>

pablo-garay · 2024-12-31T16:16:52Z

CI passed: https://github.com/NVIDIA/NeMo/actions/runs/12553124376

linnanwang · 2024-12-31T19:20:47Z

@pablo-garay @ethanhe42 thanks!

* vae training Signed-off-by: linnan wang <[email protected]> * vae training Signed-off-by: linnan wang <[email protected]> --------- Signed-off-by: linnan wang <[email protected]> Signed-off-by: Abhinav Garg <[email protected]>

vae training

2fb3901

Signed-off-by: linnan wang <[email protected]>

ethanhe42 self-requested a review December 14, 2024 01:04

ethanhe42 previously approved these changes Dec 14, 2024

View reviewed changes

github-advanced-security bot found potential problems Dec 14, 2024

View reviewed changes

vae training

937b2ff

Signed-off-by: linnan wang <[email protected]>

linnanwang dismissed ethanhe42’s stale review via 937b2ff December 14, 2024 03:26

ethanhe42 approved these changes Dec 16, 2024

View reviewed changes

Merge branch 'main' into main

0f0977c

linnanwang changed the title ~~vae training~~ vae training and autovae Dec 16, 2024

ethanhe42 enabled auto-merge (squash) December 30, 2024 17:14

ethanhe42 added the Run CICD label Dec 30, 2024

Merge branch 'main' into main

2eb01ce

ethanhe42 added Run CICD and removed Run CICD labels Dec 30, 2024

Merge branch 'main' into main

8c987c0

pablo-garay disabled auto-merge December 31, 2024 19:18

pablo-garay merged commit b73bfff into NVIDIA:main Dec 31, 2024
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vae training and autovae #11592

vae training and autovae #11592

linnanwang commented Dec 13, 2024 •

edited

Loading

pablo-garay commented Dec 31, 2024

linnanwang commented Dec 31, 2024


		self.discriminator = NLayerDiscriminator(
		input_nc=disc_in_channels, n_layers=disc_num_layers, use_actnorm=use_actnorm

vae training and autovae #11592

vae training and autovae #11592

Conversation

linnanwang commented Dec 13, 2024 • edited Loading

What does this PR do ?

Changelog

Usage

GitHub Actions CI

Before your PR is "Ready for review"

pablo-garay commented Dec 31, 2024

linnanwang commented Dec 31, 2024

linnanwang commented Dec 13, 2024 •

edited

Loading